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Summary 


This  report  presents  the  results  of  a  modification  to  a  research  program 
to  acquire  data  on  the  spectral  and  spatial  characteristics  of  natural  vegetative 
terrains  and  to  develop  methodologies  for  the  analysis  of  these  data  as  an  aid 
In  the  design  of  camouflage  patterns  for  field  clothing  and  large  cloth  shelters. 
The  program  has  been  concerned  with  camouflage  for  the  visible  spectrum  and  for 
human  observers. 

Technical  aspects  In  the  development  of  modifications  of  the  existing  soft¬ 
ware  for  data  analysis  are  summarized  and  recommendations  for  further  software 
development  are  presented. 

The  following  tasks  were  addressed: 

1.  Optimization  of  the  clustering  In  the  CIE  1976  (L*a*b*)  color  space  to 
remove  any  bias  Introduced  by  the  highly  efficient  histogram  preprocessing  scheme. 

2.  Evaluation  of  data  separability  parameters  and  a  determination  of  the 
minimum  number  of  color  domains  necessary  to  represent  a  scene. 

3.  Modification  of  the  terrain  analysis  aoftwaro  so  that  the  user  can 
specify  any  rectangular  segment  of  the  original  scene  as  the  area  of  Interest. 

A.  Modifications  of  the  symbol  plotting  software  to  enable  Individual 
domain  plotting. 

All  software  has  been  Implemented  on  the  UNIVAC  1106  computer  located  at  the 
Natick  Research  and  Development  Laboratories. 


PREFACE 


The  reported  work  was  performed  for  US  Army  Natick  R&D  Laboratories  under 
Contract  No.  DAAK60-79-C-0072  (Modification  No.  P00005)  with  Mr.  Alvin  0. 
Ramsiey,  Project  Officer.  The  Dec! log  effort  was  ably  led  by  J.  Richard 
Goldgraben.  This  work  Is  part  of  Project  IL62723AH98,  Clothing,  Equipment, 
and  Shelter  Technology;  Task  AB,  Passive  Countersurveillance  Measures  for  the 
Individual  Soldier, 
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MOO  IF ICAT IONS  TO 

DATA  ACQUISITION  ANO  ANALYSIS  FOR  CAMOUFLAGE  DESIGN 


I.  OVERVIEW 

a.  Background 

i  2  t 

This  report  presents  the  results  of  a  modification  to  a  research  program  '  * 
to  acquire  data  on  the  spectral  and  spatial  characteristics  of  natural  vegetative 
terrains  end  to  develop  methodologies  for  the  analysis  of  these  data  as  an  aid 
In  the  design  of  camouflage  patterns  for  field  clothing  and  large  cloth  shelters. 
The  program  has  been  concerned  with  camouflage  for  the  visible  spectrum  and  for 
human  observers. 

Data  from  vegetative  terrain  bachgrounds  are  acquired  by  photographic  and 
digitization  procedures.  Computer  programs  generate  spectral  reflectance  curves 
for  each  resolution  element  In  the  scene  and  analyze  the  colorimetric  character¬ 
istics  of  the  terrain  In  terms  of  CIE  1976  (L*aAb*'>)  color  space.  Among 
the  outputs  of  the  computer  programs  Is  a  map  showing  the  shape  and  distribu¬ 
tion  of  regions  In  the  scene  possessing  similar  colorimetric  characteristics. 

The  spectral  and  spatial  properties  of  these  regions  are  to  be  used  as  the  basis 
for  the  design  of  three,  four,  or  five  color  domain  camouflage  patterns. 

Terrain  data  were  acquired  for  vegetative  terrains  typical  of  the  temperate 
regions  of  Europe  and  North  America  In  both  the  dormant  and  verdant  state.  Data 


^J.  R.  Goldgraben  and  B,  Engelberg,  Final  Report  on  Data  Acquisition  and 
Analysis  for  Camouflage  Design,  Dec!  log,  Inc.,  Melville,  NY  April  iWV  (becl  log 
Report  No.  236,  Contract’  DAAK60-79-C-0072) , 

2 

J.  R,  Goldgraben  and  B.  Engelberg,  Procedures  for  the  Acquisition  and 
Analysis  of  Terrain  Data  for  Camouflage  tesign .“To lu'me  t.  Software  kanuat, 
becl  log,  Inc.  ,  Melvil  le,  NV,  March  1 9b l  (becl  log  Report  23*»,  Contract 
DAAK60-79-C-0072). 

^J.  R,  Goldgraben  and  B,  Engelberg,  Procedures  for  the  Acquisition  and 
Analysis  of  Terrain  Data  for  Camouflage  tie's  fan.  Volume  2,  ManuaVYor  RKotographl  c 
beta  Acqulsl  tion  and  Him"  Digitization, '  beef  log,  Inc'.,  Welvllle,  NV,  March  I9&1. 
TbecH  og  ReporF  'No 235,  Con  tract  bAA  k60-  79-C-  0072 ) . 
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for  both  front  lighted  direct  solar  illumination  and  solar  illumination  with 
moderately  overcast  sky  conditions  were  obtained. 

This  report  summarizes  the  technical  aspects  In  the  development  of  modlfi~ 
cations  of  the  existing  software  for  data  analysis  and  presents  recommendations 
for  further  possible  software  development.  Full  documentation  of  the  Data 
Processing  end  Analysis  Software  and  of  the  Photographic  Data  Acquisition  and 
Digitization  Procedures  are  contained  In  references  found  on  page  67. 

The  data  processing  and  analysis  software  and  the  modifications  under  this 
phase  of  the  contract  have  been  Implemented  on  the  UNIVAC  1106  computer  located 
at  the  Natick  Research  and  Development  Laboratories. 

b.  Task  Requirements  for  This  Phase 

The  following  tasks  were  addressed  under  this  Contract  Modification: 

(1 )  Optimization  of  Clustering 

In  the  previously  delivered  data  analysts  program,  a  histogram  algorithm 
(HIST)  was  used  as  a  first  step  In  the  clustering  of  the  CIELAB  values  of  the 
scene  pixels.  This  algorithm  Is  highly  efficient  and  greatly  reduces  the 
number  of  computations  that  must  be  performed  by  the  Euclidean  clustering 
algorithm  (GEOM).  This  histogram  process  is  not,  however,  an  optimal  process 
and  the  CIELAB  coordinates  of  the  final  color  domains  can  be  influenced  by  the 
color  coordinate  Increments  used  In  the  hlstograronlng  and  by  the  number  of 
Intermediate  clusters  which  exist  when  the  clustering  process  switches  from 
HIST  to  GEOM.  The  Increment  levels  and  number  of  Intermediate  domains  are 
user-apecl f led. 

It  was  recommended,  therefore,  that  an  optimization  routine  be  added  to 
the  existing  color  domain  clustering  process.  The  CIELAB  values  assigned  to 
the  three,  four,  or  five  color  domains  by  the  existing  histogram  and  Euclidean 
algorithms  are  Inputted  into  the  optimization  routine  which  re-adjusts  the 
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C  t  E  LAB  centroid  values  using  a  nearest  means  Iterative  optimization  algorithm. 
The  algorithm  elmlnates  most  of  the  bias  that  may  have  been  Introduced  In  the 
histogram  clustering  and  thereby  makes  the  spectral  and  spatial  characteristics 
of  the  final  color  domains  Independent  of  the  user's  choice  of  clustering 
parameters. 

(2)  Data  Separabt 1 1 ty 

Each  of  the  scenes  should  be  analyzed  and  the  following  tasks  performed: 

-  Develop  a  metric  which  can  be  used  to  evaluate  the  "goodness"  of  the 
calculated  domain  centroids. 

-  Examine  procedures  for  determining  the  "correct"  number  of  domains 
for  each  scene. 

(3)  Scene  Segmentation 

The  Terrain  Analysis  Software  should  be  modified  so  that  the  user  has  the 
ability  to  specify  any  rectangular  segment  of  the  original  scene  as  the  area 
of  Interest.  In  this  manner  some  unwanted  areas  could  be  removed  (l.e,  sky, 
grass)  and  the  user  could  compare  the  In*ra-scene  variability  by  processing 
selected  segments  of  the  scene. 

(M  Domain  Plotting 

The  symbol  plotting  routine  should  have  the  capability  of  plotting  each 
domain  Individually  as  well  as  producing  a  combined  plot.  The  usefulness  of 
contour  plotting  should  be  examined. 
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2.  OPTIMIZATION  OF  CLUSTERING 


a.  General  Discussion 

OPTIM  Is  the  clustering  optimization  routine.  It  is  designed  to  remove 
some  of  the  clustering  Imperfections  either  Introduced  by  HIST  or  resulting 
from  an  Improper  selection  of  clustering  parameters  by  the  user.  The  actual 
algorithm  for  the  optimization  Is  contained  in  a  subroutine  of  OPTIM  called 
REOOM  (See  Figures  I  and  2). 

The  first  time  OPTIM  calls  REOOM,  REOOM  determines  the  domain  centroid 
closes*  to  each  pixel.  If  the  squared  distance  of  a  pixel  to  the  closest 
domain  Is  less  than  or  equal  to  a  user  specified  number  of  domain  variances, 

REOOM  assigns  the  pixel  to  that  domain.  If  the  pixel  to  domain  centroid 
distance  exceeds  this  value,  the  pixel  Is  eliminated  from  the  clustering  on 
this  and  all  subsequent  Iterations,  REOOM  then  calculates  new  centroids  and 
variances  for  the  domains  based  on  the  assigned  pixels,  calls  OUTPUT,  and 
returns  to  OPTIM. 

OPTIM  now  enters  a  loop  governed  by  a  user-parameter  which  specifies  the 
maximum  number  rf  Iterations  through  the  loop.  REDOM  is  called  in  each  Iteration 
and  reassigns  pixels  to  the  closest  domain  centroids  (which  have  been  recom¬ 
puted  In  the  prior  Iteration)  and  keeps  track  of  the  number  of  changes  In 
pixel  domain  assignments.  REDOM  then  recalculates  domain  centroids  and 
variances  based  on  the  new  domain  assignments,  calls  OUTPUT,  and  returns  to 
OPTIM.  If  the  number  of  domain  assignment  changes  In  an  Iteration,  Is  0,  or 
If  the  maximum  loop  number  Is  reached,  OPTIM  exits  Its  loop. 


FIGURE  I  SUBROUTINE  OPTIM 


FIGURE  2  SUBROUTINE  REDOM 


(b) 


’was  plxeP 
!  I iminatedl 


Update  summing  arrays 


^✓was  plxeP'N. 
'assigned  to  a 
di f ferent  domain 
\ihan  before?  ^ 


Increment  1 
of  changes 


. . 

Write  row  of  domain 
assignments  on  disk 

3 

4 

^  RETURN  J 

FIGURE  2  (CONTINUED) 
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b.  Validation  of  Subroutine  OPT  I H 


The  method  that  was  chosen  to  validate  this  routine  was  to  Intentionally 
select  clustering  parameters  that  produced  Incorrect  results  and  to  see  how 
wall  OPTIM  corrects  for  the  Improper  selections.  Scene  10  (4  Color  Woodland 
Camouflage  Cloth)  was  used  as  data  since  correct  L*a*b*  values  are  known  for 
each  of  the  four  domains. 

The  Improper  clustering  parameters  were  chosen  by  examining  Table  4  of 
the  Final  Report  on  Date  Acquisition  end  Analysis  for  Camouflage  Design  , 

Case  7  was  chosen  and  tha  results  that  were  achieved,  prior  to  running  OPTIM, 
are  repeated  below. 


TABLE  1  "IMPROPER"  CLUSTERING  PARAMETERS  (No  Optimization) 


DOMAIN  # 

#  OF  PIXELS 

L* 

a* 

b* 

1 

23 

25.08 

3.93 

9.88 

2 

10 

31.99 

-10.06 

12.00 

3 

10 

26.11 

2.97 

10.90 

4 

23 

24.98 

2.95 

9.57 

The  above  run  accounted  for 

only  6%  of 

the  total 

number  of  pixels  In  the 

:lzed  scene 

(66  out  of  1131) 

and  only  two  distinct 

domains  were  produced. 

OPTIM  was  then  run  (specifying  a  maximum  of  IS  loops)  end  the  variance 
cut-off  was  sufficiently  large  so  as  not  to  discard  any  pixels  on  the  basis  of 
being  too  far  from  an  obviously  bad  domain  centroid.  OPTIM  needed  only  10  loops 
to  reach  the  situation  where  there  were  no  changes  In  pixel  domain  assignments. 
The  results  of  clustering  after  OPTIM  are  outlined  below. 


See  Reference  I. 
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TABLE  2  "IMPROPER"  CLUSTERING  PARAMETERS  {After  Optimization) 


OOMAIN  I 

I  OF  PIXELS 

L* 

a* 

b* 

1 

398 

25. 64 

3.30 

9.69 

2 

258 

40.93 

-3.26 

16.16 

3 

312 

30.97 

-7.87 

11.58 

it 

163 

19.67 

-.46 

.89 

Thee*  values  can  be  compared  to  the  run  with  "good"  cluttering  parameters 
with  no  optimization  (Table  3). 

TABLE  3  "PROPER"  CLUSTERING  PARAMETERS  (No  Optimization) 


OOMAIN  I 

§  OF  PIXELS 

L* 

a* 

b* 

1 

356 

24.79 

3.10 

9.20 

2 

160 

41.17 

-3.93 

15.66 

3 

218 

31. SI 

-8.05 

12.10 

4 

82 

18.10 

0.33 

-0.01 

An  examination  of  these  three  tables  shows  that  even  when  starting  from 
obviously  Incorrect  domain  centroids,  OPT IM  will  Iteratlvoly  adjust  those 
centroids  until  they  properly  represent  the  scene.  The  minor  difference  be¬ 
tween  the  two  final  results  (Tables  2  end  3  )  can  be  attributed  to  the  fact 
that  the  run  with  Improper  clustering  parameters  was  forced  (by  the  setting 
of  e  very  high  variance  cut-off)  to  consider  every  pixel  In  the  scene. 

An  Interesting  feeling  for  the  workings  of  OPTIM  can  be  obtained  by 
examining  Figure  3,  This  Is  a  graph  of  a*b*  space  and  shows  the  Initial 
four  Improper  domains  and  their  escncletad  L*  values.  The  centroids  of  these 
domains  are  then  shown  to  migrate  to  their  final  values  during  each  Iteration 
OPTIM.  The  final  L*  values  for  the  resulting  centroids  are  also  shown. 


, ,  ... . 
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3.  DATA  SEPARABILITY 

This  section  describes  some  statlsticel  tests  which  were  performed  on 
the  L*a*b*  date.  The  purposes  of  these  tests  were  to  determine: 

-  for  a  given  scene,  what  Is  the  "correct"  number  of  domains? 

-  After  clustering,  how  "good"  are  the  obtained  domains? 

As  will  be  discussed  below,  the  first  question  can  be  answered  only 
as  to  what  Is  the  minimum  number  of  domains  which  adequately  represent  the 
scene.  The  answer  to  the  second  question  Is  approached  by  testing  whether 
the  domains  are  statistically  different  from  one  another. 

For  all  scenes  analyzed,  the  minimum  number  of  domains  required  was 
found  to  be  three,  four,  or  five.  For  the  Woodland  Camouflage  Cloth,  four 
domains  were  required.  In  all  cases,  at  the  minimum  number  of  domains,  they 
represented  statistically  "good"  clustering. 


Number  of  Domains  Required 


The  Statement  of  Work  requires  that  it  be  possible  to  reduce  all  real 
world  scenes  to  five,  four,  or  three  domains.  For  any  given  scene,  It  seems 
appropriate  to  ask  the  question  "How  many  domains  should  there  be?  "  Phrased 
differently,  this  problem  Is  finding  the  "optimum"  number  of  domains. 

Intuitively,  It  would  seem  that  an  analysis  of  the  variances  (squared 
vector  distances  In  L*a*b*  space)  of  domain  and  pixel  L*a*b*  values  could  be 
used  to  solve  this  problem.  Figure  i*  Isa  two  dimensional  plot  of  four 


b* 


0.  through  represents  Domains  I  through  k 

C(  and  Cj  rapresants  tha  Cantrolds  of  Domains  1  and  2 

Pj  and  P2  ara  two  Plxals  within  Domain  1 

I- 1 C'IRE  <4  FOUR  IDEALIZED  DOMAINS 

Not a  that  all  of  tha  pixels  used  to  cluster  to  tha  four  Imaginary  domains 
ara  contained  within  tha  outline  of  tha  domain.  Now,  tha  separation  of  tha 
domains  should  be  related  to  tha  distance  between  the  centroids  (C(  to  Cg, 

C|  to  C^,  etc.)  and  the  "compactness"  of  tha  domains  should  be  related  to 
the  distance  between  plxals  within  a  domain  and  the  domain  centroid  (Cj  to 
Pj,  C j  to  P2,  ate.) 

Tha  vary  use  of  tha  terms,  between  and  within,  suggests  an  Analysis  of 
Variance  where  the  statistical  test  of  the  "goodness"  of  the  clustering  Is 
the  ratio  of  the  between  cluster  variance  to  the  within  cluster  variance. 

This  ratio,  referred  to  as  the  F  ratio,  can  be  used  to  test  a  hypothesis  that 
the  obtained  clustering  arose  from  chance  alone.  If  the  obtained  ratio  Is 
sufficiently  large,  this  hypothesis  Is  rejected  and  it  Is  Inferred  thut  the 
clutters  are,  In  fact,  distinct. 

Before  proceeding  In  this  manner,  a  literature  survey  was  conducted  In 
the  Image  Analysis  area  to  determine  what  alternative  procedures  might  have 
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been  developed.  One  paper  was  found  which  discussed  measures  of  the  "goodness" 
of  clustering.  It  discusses  two  useful  criteria  which  are  defined  as  B^  and 
Bj.,  Is  equivalent  to  the  F  ratio  discussed  above.  Bg  is  the  product  of  the 
between  and  within  variances.  Bg  Is  said  to  be  useful  since,  when  each  pixel 
Is  a  domain,  within  variance  Is  zero,  and  when  all  pixels  are  Assigned  to  one 
domain,  between  variance  Is  zero.  Hence,  In  both  extremes  Bg  Is  zero.  The 
authors  state  that  It  follows  that  the  "best"  clustering  occurs  at  the  number 
of  clusters  which  maximizes  Bg. 

Decllog  could  find  no  logical  basis  for  this  assertion.  Consider  the 
processes  which  occur  In  cluttering  of  a  fixed  number  of  pixels.  If  each  and 
every  pixel  has  a  unique  L*,  a1'1  and  b*,  clearly  the  "best"  characterization  of 
that  scene  Is  n  domains,  where  n  Is  the  number  of  pixels.  It  Is  Irrelevant,  at 
this  point,  that  this  Is  of  no  value  to  camouflage  design.  As  pixels  are 
assigned  to  domains,  and  provided  the  number  of  pixels  per  domain  Is  fairly 
large,  the  between  domain  variance  will  also  increase  as  the  number  of  domains 
decreases.  If  a  "good"  clustering  algorithm,  such  as  OPTIM,  Is  used,  the 
within  domain  variance  will  also  Increase  as  the  number  of  domains  Is  decreased. 
Therefore,  Bg  will  monotonlcal ly  Increase  as  the  number  of  domains  decreases  and 
will  be  a  maximum  at  two  domains. 

Bg  was  calculated  for  each  scene  analyzed  to  date,  and  was  found  to  be 
a  maximum  at  two  domains,  and  to  decrease  with  Increasing  numbers  of  domains. 

It  ts  concluded  that  Bg  Is  not  a  good  criterion.  For  this  reason  (or, 
equivalently,  the  F  ratio)  Is  suggested  for  use  to  evaluate  the  number  of 
dome  I  ns  required,  B^  Is  the  ratio  of  the  between  domain  variance  to  the 
within  dometn  variance.  This  then  CAn  be  a  measure  of  the  extent  to  which 
the  domains  account  for  the  total  scane  varlanca. 

Ona  simply  chooses  the  percentage  of  the  total  variance  In  all  of  the 
pixels  being  clustered  which  he  wants  to  be  accounted  for  by  the  domains.  In 
the  analysis  done  to  date,  90%  of  the  total  variance  has  been  the  criterion. 

®G.  B.  Coleman  and  H.  C,  Andrews,  Image  Segmentation  by  Cluttering, 
Proceedings  of  the  IEEE,  Vol.  67,  No.  5,  Hey  1979. 


One  then  reads  from  a  Table  of  Critical  Values  of  F,  the  90%  values  for  1-n 
degrees  of  freedom,  where  n  Is  the  number  of  domains. 

Figure  5  Is  a  plot  of  B^  for  the  Woodland  Camouflage  Cloth  for  one 
through  seven  domains.  The  dashed  line  It  the  critical  value  at  which  the 
domains  account  for  90%  of  the  total  variance.  If  one  goes  to  the  next  higher 
Integral  number  of  domains  from  the  crossover,  one  sees  the  minimum  number  of 
domains  which  account  for  90%  of  the  total  variance.  Any  higher  number  of 
domains  will  account  for  a  greater  proportion  of  the  total  variance.  It  may 
be  noted  In  this  example,  the  Woodland  Camouflag#  Cloth,  that  four  Is  the 
minimum  number  of  domains  required  to  account  for  90%  of  the  variance. 

Figures  6  through  12  show  similar  plots  for  other  scenes  which  have 
been  analyzed.  Note  that  the  minimum  number  of  domains  Is  threa  (Scene  13) 
end  the  maximum  1s  five  (Scenes  3,  14,  end  21$  Table  4). 

Figures  13  through  20  show  the  L*a*b*  values  for  the  domain  centroids 
for  each  scene  analyzed  to  date,  together  with  a  verbal  dascrlptlon  of  the 
scene.  Figures  13,  14,  and  16  are  dormant  scenes.  Note  that  all  centroids 
are  In  the  "red11  quadrant.  In  Figure  14,  It  Is  believed  that  tha  two  centroids 
below  the  horizontal  axis  may  have  been  caused  by  digitizing  the  sky  background 
between  the  limbs  of  the  dormant  trees. 

Figure  IS  depicts  the  domain  centroids  obtained  from  thn  analysis  of  the 
Woodland  Camouflage  Cloth.  Despite  the  fact  that  there  Is  no  traceability 
between  the  reflectance  targets  used  In  the  photographs  and  NLAB's  colorimeter, 
one  can  observe  the  remarkable  similarity  betwaon  then  values  end  those  mea¬ 
sured  by  NLAB's. 


TABLE  4  s  MINIMUM  NUMBER  OF  DOMAINS 


SCENE  I 

3 

4 
10 
11 
12 

13 

14 
21 


TYPE 

Near 

Far 

Naar 

Near 

Far 

Near 

Far 


_ DESCRIPTION 

Deciduous,  Brush  (Dormant) 
Deciduous,  Brush  (Dormant) 
Woodland  Camouflage  Cloth 
Fruit  Orchard  (Dormant) 
Deciduous,  Brush  (Verdant) 
Coniferous  (Verdant) 
Deciduous,  Brush  (Verdant) 
Deciduous,  Brush  (Verdant) 


MINIMUM  #  DOMAINS 

5 

4 

4 

4 

4 
3 

5 

5 


-  -•  mini mUm  BETA  A 
required 

•)  wi  th  qpt  I  ml  2«t  I  on 
X  wl  thotft  optimization 


minimum  BETA  4 
required 

wl  th  optimization 
without  optimization 


FIGURE  16  Scan*  11  «*■  Domain*  In  a*  v*.  b*  Plan# 
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FIGURE  20  Scene  21  “  Domains  In  a*  vs.  b  Plane 

The  remainder  of  the  scenes  are  either  coniferous  or  deciduous-verdant. 

All  points  are  in  the  "green"  quadrant.  Each  has  a  "black"  domain.  Figure 
17.  for  example,  displays  a  peculiarity  of  some  dec iduous-verdant  scenes. 

Note  the  correlation  among  the  values  of  the  L*,  art,  and  b*  values  of  the 
obtained  centroids,  the  a*  and  b*  components  have  been  referred  to  by 
Ramsley^  as  an  Iso  Hue  line.  In  this  case,  the  value  of  L*  Is  also  correlated 
with  both  the  a*  and  b*  values.  Whether  or  not  this  has  any  significance  has 
not  been  investigated. 

Finally,  Figure  21  is  a  composite  plot  of  the  domain  centroids  for  all 
scenes  analyzed.  Also  shown  is  the  space  enclosed  In  a*,  b*  space  by  the 
centroids  of  the  Woodland  Camouflage  Cloth.  Since  It  seems  intuitively  logical 
that  camouflage  cloth  should  avoid  extremes,  visual  Inspect  Ion  would  Indicate 
that  the  choice  of  L*a*b*  values  for  the  Woodland  Camouflage  cloth  was  good. 

An  interesting  exercise  would  be  to  start  with  these  centroids,  weighted  by 
the  number  of  pixels,  and  cluster  to  four  domains.  A  comparison  could  then 
be  made  between  the  camouflage  cloth  values  and  the  four  domains.  It  Is  pos¬ 
sible  that,  although  only  eight  scenes  have  been  analyzed,  they  are  somehow 
representative  of  the  world  on  which  the  Woodland  Camouflage  cloth  was  based, 

/ 

b.  "Goodness"  of  Domains 

After  clustering  to  the  minimum  number  of  domains,  It  Is  desirable  to 
test  the  quality  of  the  clustering.  As  will  be  discussed  below,  the  statis¬ 
tical  test  compares  the  spacing  between  domain  centroids  with  the  "compactness" 
of  the  domains.  The  purpose  of  the  test  Is  to  determine  If  the  L*a*b*  values 
of  the  domain  centroids  are  statistically  significantly  different.  If  they 
are,  clustering  Is  "good"  because  the  spacing  between  centroids  is  large,  and 
the  domains  art*  "compact".  The  statistical  tests  will  Indicate  the  probability 
that  the  difference  In  L*a*b*  values  of  any  two  centroids  occurred  by  chance, 
as  opposed  to  the  difference  having  arisen  from  real  characteristics  of  the 
scene.  The  results  will  be  discussed  below. 

^A.  0.  Ramsley,  Selection  of  Standard  Colors  for  the  Woodland  Camouflage 
Pattern  ,  US  Army  Natick  Research  and  Development  Laboratories,  September  1901, 

(Technical  Report  NATICK-81/030). 
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Prior  to  this  discussion,  however,  the  practical  slgnl f icance  of  these 
tests  should  be  evaluated.  Due,  at  least  In  part,  to  the  fact  that  very  large 
numbers  of  pixels  are  assigned  to  each  domain,  the  statistical  tests  become  very 
sensitive.  As  a  result,  very  small  differences  In  the  location  of  centroids 
will  be  found  to  be  "statistically  different".  Thus,  if  a  scene  has  been  clus¬ 
tered  down  to  ten  domains,  for  example,  the  statistical  tests  could  likely 
Indicate  that  there  are  ten  "significantly  different"  domains  at  the  confidence 
level  chosen. 

The  actual  distance  between  some  pairs  of  centroids  In  L*a*b*  space  may 
be  small,  even  less  than  1  unit,  which  Is  the  theoretical  visual  difference 
threshold.  Theoretical  Is  underscored  since  It  must  be  remembered  that  It  Is 
based  on  the  1931  standard  observer,  using  the  method  of  paired  comparisons. 

The  threshold  which  Is  applicable  to  camouflage  cloth  when  viewed  under  field 
conditions  Is  not  known,  but  Is  probably  larger  then  I  unit. 

Therefore,  It  should  be  kept  In  mind  that  the  term  "significantly  different* 
or  "different"  refers  to  the  precision  and  separability  of  domains  from  a  sta¬ 
tistical  viewpoint  and  not  to  the  visual  difference  between  domains  as  applied 
to  camouflage  cloth  In  countersurvell lance. 

Student  t  and  Fischer  F  tests  were  run  to  test  the  statistical  significance 
of  the  distance  between  centroids  InCIELAB  space.  The  F  test  considers  alt 
domains  simultaneously  and  compares  their  combined  variance  with  the  spectra! 
variance  in  the  overall  scene.  The  t  test  comperes  distances  between  Individual 
pairs  of  centroids  with  the  pixel  variance  within  the  respective  domains. 

Both  the  t  and  F  tests  are  calculated  and  printed  out  In  subroutine  OUTPUT 
for  all  clusterings  of  ten  and  fewer  domains.  Because  the  t  test  operates  on 
pairs  of  centroids,  multiple  t  tests  had  to  be  performed;  and  In  order  to  keep 
the  confidence  level  within  each  test  high,  an' a  priori  confidence  level  of 
0.99  was  chosen.  This  means  that  If  the  obtained  value  of  t  exceeds  the  criti¬ 
cal  value  or  the  probability  Is  0.01  or  less,  the  difference  is  due  to  change. 

As  It  turned  out,  for  the  scenes  analyzed  to  date,  the  values  of  all  t's  for 
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ten  and  fewer  domains  are  far  greater  than  the  critical  value  for  0.99  confi¬ 
dence.  It  Is  probable  that,  In  the  future,  even  higher  confidence  levels  can 
be  used. 

On  the  basis  of  the  t  tests  It  Is  concluded  that,  with  as  many  as  ten 
domains,  each  and  every  domain  Is  different  from  every  other  domain.  Given 
this  separability,  It  Is  Inferred  that  the  clustering  Is  "good". 

Because  the  F  test  compares  each  domain  with  the  distance  to  all  other 
centroids  simultaneously,  a  lower  confidence  level,  namely  0.90  was  chosen 
•  priori.  The  results  of  the  F  tests  confirmed  the  results  of  the  t  tests. 
Again,  with  as  many  as  ten  and  as  few  as  two  domains,  each  domain  was  compact 
as  compared  to  the  separation  between  all  domains.  Based  on  this  conclusion, 
It  Is  again  Inferred  that  the  clustering  Is  "good". 

The  obtained  F  and  t  statistics  for  each  run  analyzed  are  shown  In 
the  Appendix. 
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*».  SCENE  SEGMENTATION 


a.  General  Discussion 

One  of  the  extensions  to  the  software  for  the  Analysis  of  Terrain  Data 
for  Camouflage  Design  was  the  ability  to  process  only  part  of  a  total  scene. 

In  the  original  software  package  the  user  specified  the  number  of  columns  and 
rows  of  the  entire  scene  and  the  entire  area  was  processed.  The  user  Is  still 
required  to  specify  the  number  of  columns  and  rows  of  the  entire  scene  but  also 
to  specify  that  part  of  the  scene  that  Is  actually  to  be  processed. 


As  can  be  seen  from  Figure  22  the  user  Is  required  to  specify  the  lower 
left  hand  and  upper  right  hand  corner  of  the  rectangle  of  Interest.  This  is 
accomplished  by  setting  the  variables  ISEGRI  and  ISEGR2  to  the  starting  and 
ending  row  numbers,  respectively.  The  variables  ISEGC1  and  ISEGC2  are  set  to 
the  starting  and  ending  column  numbers,  respectively.  The  Terrain  Analysts 
Software  then  works  on  this  scene  segment  In  exactly  the  same  manner  as  If  It 
were  the  entire  scene. 

1  ISEGC2 

V  Row  IROWMX 

'  y  Column  ICOLMX 


ISEGRI  ^ 


Row  I 
Column  I 


ISEGR2  > 


i ISEGCI 


FIGURE  22  SPECIFYING  A  SCENE  SEGMENT 


b.  Symbol  Plot*  of  Scene  Segments 


The  modifications  that  ware  necussary  to  Incorporate  the  Ability  to 
process  scene  segments  extend  to  the  plotting  software.  It  seemed  reasonable 
that  the  entire  scene  should  be  outlined  and  the  scene  segment  plotted  at  the 
proper  offsets.  This  allows  easy  comparisons  of  multiple  runs  of  the  same  scene 
as  well  as  a  quick  visual  check  of  the  area  of  Interest.  In  order  to  accomplish 
this  It  was  necessary  to  pass  the  coordinates  of  the  segment.  In  addition  to 
the  size  of  the  total  scene  to  the  plotting  routine  (SYMPIOT). 

An  example  of  this  appears  on  the  following  pages.  Figure  23  Is  a  plot 
of  the  entirety  of  Scene  3  after  being  clustered  to  3  domains.  Arrows  have 
been  hand  drawn  to  Indicate  the  scene  segment  that  was  specified  on  a  subse¬ 
quent  run  of  the  analysis  software.  The  scene  segment  coordinates  that  were 
specified  were  ISEGRI-30,  ISEGR2-60,  ISEQCl-50,  and  ISEGC2-80.  The  resulting 
plot  can  be  seen  In  Figures  24  and  25. 

The  scene  segment  that  appears  In  these  figures  was  not  generated  by  clustering. 
The  option  of  specifying  centroida!  L*a*b*  values  and  then  assigning  pixels 
to  the  closest  domain  centroid  was  used.  This  was  done  so  that  the  segment 
plot  and  the  segment  area  of  the  entire  scene  would  be  Identical.  It  Is 
Important  to  realize  that  If  we  had  clustered  the  scene  segment  the  resulting 
centroids  might  have  been  very  different. 
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FIGURE  23  SCENE  3 
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SEGMENTED,  DOMAIN  2 
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5.  DOHA  I N  PLOTTING 


a.  Individual  Domain  Symbol  Plots 

Tha  composlta  symbol  plot  that  resulted  from  an  analysis  of  Scene  3 
(Dormant,  Deciduous  Brush-Near)  Is  shown  In  Figures  31  and  32.  In  order  to 
give  the  user  a  better  appreciation  of  the  shapes  associated  with  each  of  the 
domains,  the  software  was  modified  to  allow  for  the  plotting  of  each  domain 
separately.  The  Individual  domain  plots  for  Scene  3  can  be  found  In  Figures 
33  through  37  and  for  Scene  13  In  Figures  38  and  42. 

The  user  has  the  option  of  plotting  the  composite  alone,  the  Individual 
domains  alone,  or  both  the  composite  and  Individual  domains.  The  capability 
can  be  combined  with  scene  segmentation  and  example  results  for  Scene  3  are 
shown  In  Figures  24  through  28, 

b.  Contour  Plotting 

The  usefulness  of  plotting  contours  as  compared  to  the  supplied  symbol 
plots  of  domains  was  examined,  It  Is  felt  that  contour  plotting  of  the  compos¬ 
ite  domain  data  would  not  be  of  any  help  In  the  graphical  representation  of 
domain  shapes.  These  contour  plots  would  be  arbitrarily  misleading.  It  should 
be  pointed  out  that  contour  plotting  Is  not  the  same  as  shape  outlining  and 
this  discrepancy  is  what  creates  most  of  tha  problems.  A  contour  plot  would 
graphically  show  the  slope,  or  rate  of  change,  between  domains.  This  would 
arbitrarily  assign  a  greater  weight  to  a  change  from  Domain  1  to  Domain  5  then 
from  Domain  1  to  Domain  2. 

Even  If  the  contouring  routines  were  used  on  the  composite  data  to  outline 
one  domain  at  a  time,  similar  to  tha  Individual  symbol  plots,  tha  results  would 
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be  misleading.  Consider  Figure  29  as  a  set  of  pixels  with  their  associated 
domain  numbers. 

1111111111) 

1  2  2  2  1  1  1  3  3  3  1 

1  2  2  2  1  1  1  3  3  3  1 

1  2  2  2  1  1  1  3  3  3  1 

1  I  1  1  1  I  1  I  I  I  1 

FIGURE  29 i  EXAMPLE  DOMAIN  DATA 

If  we  were  to  draw  contour  lines  at  levels  1.5  end  2.5*  In  order  to 
outline  Domain  2,  we  would  generate  the  plot  shown  In  Figure  5-2. 


FIGURE  301  CONTOURING  OF  EXAMPLE  DOMAIN  DATA 

The  contour  lines  that  were  handdrawn  were  squared  off  for  simplicity. 

The  resulting  plot,  however,  Illustrates  the  problem  with  contouring.  By 
definition  contouring  assumes  that  there  must  be  a  Level  2  between  the  Level 
1  and  3  areas.  This  It  Indicating  a  faster  rise  on  the  right-hand  tide  at 
compared  to  the  left. 

A  method  that  might  be  of  some  use  Involves  modifying  the  data  file  that 
Is  contoured.  If  the  data  file  Is  edited  so  that  only  one  particular  domain 
Is  left  Intact,  and  all  other  values  are  changed  to  a  background  value,  then 
contouring  would  outline  this  single  domain.  In  this  manner,  by  submitting 
five  versions  (each  of  which  has  only  a  single  domain  and  a  background),  of  the 
original  data  file  to  the  contouring  software,  outlines  of  each  of  the  five 
Individual  domains  could  be  produced. 
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The  Cal  comp  Contouring  Package  available  on  the  UN  I  VAC  1106  has  a  size 
limitation  that  precludes  using  It  on  a  complete  scene.  It  would  probably  be 
more  efficient  to  write  shape  outlining  software  than  to  attempt  to  expand  the 
contouring  package.  Shape  outlining  software  would  not  have  the  false  area 
problems  of  contouring  discussed  above. 
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INGE  GO  PI  SCALE  FACTOR  l  OBJ  CM/PLOT  CH)  J*4*  ...  ao 

^“i«.MiS:4o.  iS’-oo.  i“-;0 

07.60.  03.70.  80.20.  70*30.  71.60. 

NuNTCH  “  20  INCR  =  1 

I N T t. R V  =  G  lOtOCl  =  1 

NEWRMX  =  132  IGEORt  *1 


IN  s  5 
:hRSH  S  324 
:.WCMX  =  120 


SYMBOL 

L* 

A* 

0* 


1 

3b. 74 
8.28 
17.72 


DOMAIN  NUMBER 

2  3 


+ 

27.71 
3  .20 
1 1  .90 


X 

23-10 
0 .33 
0.39 


X 

22.13 
1  .83 
5-04 


16E.OC2  =120 
1SE.0R2  =132 


17  .96 
4.53 
1  .64 


FIGURE  31  SCENE  3  COMPOSITE 


1 16  .90. 

,  90-00. 
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FIGURE 


FIGURE  39  SCENE  13  COMPOSITE 


6.  COMMENTS  AND  RECOMMEhJATIONS 


a.  Scene  Digitization 

It  Is  strongly  recommended  that  additional  scenes  be  processed  and 
entered  Into  the  available  scene  data  base.  This  would  dramatically  Increase 
the  utilization  of  the  current  analysis  software. 

b .  Interactive  Data  Analysis  with  Graphic  Output 

The  current  data  analysis  software  was  written  for  the  NLABS  UN  I  VAC  1)06 
and  oporates  In  a  batch  processing  mode.  Although  this  was  a  reasonable  and 
cost-effective  approach  for  the  Initial  development  phase,  the  needs  of  the 
camouflage  designer  can  best  be  served  In  an  Interactive  environment  with  graphic 
output, 

The  current  analysis  software  was  structured  In  ouch  a  manner  that  It 
could  be  transferred  to  a  minl/mlcro  computing  system  (with  a  16  bit  word  length) 
with  only  minor  changes.  The  POP  11/23  computer  which  is  serving  as  a  controller 
for  the  spectrophotometer  would  be  an  ideal  host  system  for  the  Interactive 
data  analysis  software.  The  difference  In  the  word  lengths  between  the 
UNIVAC  (36  hits)  and  the  POP  11/23  (16  bits)  has  been  analyzed  and  determined 
to  be  no  problem. 

An  Interactive  menu-driven  executive  would  need  to  be  written.  This 
executive  would  be  responsible  for  guiding  a  user  through  a  data  analysis 
session  as  well  as  being  an  Interface  between  the  user  and  the  graphic  output 
devices. 

Alternate  output  methods  should  be  analyzed  to  Increase  the  utility  of 
the  resulting  domain  assignment  plots.  These  alternate  methods  should  Include 
video  display  systems  (both  Black  and  White  and  Color)  with  hard-copy  capability 
as  well  as  additional  pen  plotter  presentations. 
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c.  Alternate  Par.a  Collection  Methods 


The  current  photograph) c/dlgl tl zatl on  procedure  Is  time  consuming,  and 
with  respect  to  some  details,  less  than  optimum.  An  Investigation  of  alterna¬ 
tive  techniques  should  be  performed  and  a  procurement  specification  should  be 
developed. 

As  a  minimum  the  approaches  that  should  be  examined  Include  the  acquisition 
of  a  black  and  white  television  camera,  solid  state  array  cameras,  and  solid 
state  cameras  with  spectral  outputs. 

Systems  should  be  analyzed  for  portability,  reliability,  camera  sensitivity, 
resolution,  signal  to  noise  ratio  through  the  Interference  filters,  and  other 
parameters  still  to  be  determined.  The  goal  should  be  to  provide  a  specification 
forocommerclal ly  available  off-the-shelf  system;  however,  special  purpose 
designs  should  be  examined  for  completeness. 


This  document  reports  research  undertaken  at 
the  US  Army  Natick  Research  and  Develop¬ 
ment  Command  and  has  been  assigned  No. 
MATICK/TR-  WSJ.QLL—  in  the  series  of  re¬ 
ports  approved  for  publication. 
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APPENDIX 


Outputs  of  Terrain  Analysis  Software 
Showing  Statistical  Data 
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